Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 75
Filtrar
1.
Bioinformatics ; 40(3)2024 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-38407414

RESUMO

MOTIVATION: Prediction and identification of core promoter elements and transcription factor binding sites is essential for understanding the mechanism of transcription initiation and deciphering the biological activity of a specific locus. Thus, there is a need for an up-to-date tool to detect and curate core promoter elements/motifs in any provided nucleotide sequences. RESULTS: Here, we introduce ElemeNT 2023-a new and enhanced version of the Elements Navigation Tool, which provides novel capabilities for assessing evolutionary conservation and for readily evaluating the quality of high-throughput transcription start site (TSS) datasets, leveraging preferential motif positioning. ElemeNT 2023 is accessible both as a fast web-based tool and via command line (no coding skills are required to run the tool). While this tool is focused on core promoter elements, it can also be used for searching any user-defined motif, including sequence-specific DNA binding sites. Furthermore, ElemeNT's CORE database, which contains predicted core promoter elements around annotated TSSs, is now expanded to cover 10 species, ranging from worms to human. In this applications note, we describe the new workflow and demonstrate a case study using ElemeNT 2023 for core promoter composition analysis of diverse species, revealing motif prevalence and highlighting evolutionary insights. We discuss how this tool facilitates the exploration of uncharted transcriptomic data, appraises TSS quality, and aids in designing synthetic promoters for gene expression optimization. Taken together, ElemeNT 2023 empowers researchers with comprehensive tools for meticulous analysis of sequence elements and gene expression strategies. AVAILABILITY AND IMPLEMENTATION: ElemeNT 2023 is freely available at https://www.juven-gershonlab.org/resources/element-v2023/. The source code and command line version of ElemeNT 2023 are available at https://github.com/OritAdato/ElemeNT. No coding skills are required to run the tool.


Assuntos
Software , Humanos , Regiões Promotoras Genéticas , Ligação Proteica , Sítio de Iniciação de Transcrição
2.
Biochim Biophys Acta Gene Regul Mech ; 1865(1): 194768, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34757206

RESUMO

As computational modeling becomes more essential to analyze and understand biological regulatory mechanisms, governance of the many databases and knowledge bases that support this domain is crucial to guarantee reliability and interoperability of resources. To address this, the COST Action Gene Regulation Ensemble Effort for the Knowledge Commons (GREEKC, CA15205, www.greekc.org) organized nine workshops in a four-year period, starting September 2016. The workshops brought together a wide range of experts from all over the world working on various steps in the knowledge management process that focuses on understanding gene regulatory mechanisms. The discussions between ontologists, curators, text miners, biologists, bioinformaticians, philosophers and computational scientists spawned a host of activities aimed to standardize and update existing knowledge management workflows and involve end-users in the process of designing the Gene Regulation Knowledge Commons (GRKC). Here the GREEKC consortium describes its main achievements in improving this GRKC.


Assuntos
Regulação da Expressão Gênica , Reprodutibilidade dos Testes
3.
PLoS Comput Biol ; 17(8): e1009256, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34383743

RESUMO

Metazoan core promoters, which direct the initiation of transcription by RNA polymerase II (Pol II), may contain short sequence motifs termed core promoter elements/motifs (e.g. the TATA box, initiator (Inr) and downstream core promoter element (DPE)), which recruit Pol II via the general transcription machinery. The DPE was discovered and extensively characterized in Drosophila, where it is strictly dependent on both the presence of an Inr and the precise spacing from it. Since the Drosophila DPE is recognized by the human transcription machinery, it is most likely that some human promoters contain a downstream element that is similar, though not necessarily identical, to the Drosophila DPE. However, only a couple of human promoters were shown to contain a functional DPE, and attempts to computationally detect human DPE-containing promoters have mostly been unsuccessful. Using a newly-designed motif discovery strategy based on Expectation-Maximization probabilistic partitioning algorithms, we discovered preferred downstream positions (PDP) in human promoters that resemble the Drosophila DPE. Available chromatin accessibility footprints revealed that Drosophila and human Inr+DPE promoter classes are not only highly structured, but also similar to each other, particularly in the proximal downstream region. Clustering of the corresponding sequence motifs using a neighbor-joining algorithm strongly suggests that canonical Inr+DPE promoters could be common to metazoan species. Using reporter assays we demonstrate the contribution of the identified downstream positions to the function of multiple human promoters. Furthermore, we show that alteration of the spacing between the Inr and PDP by two nucleotides results in reduced promoter activity, suggesting a spacing dependency of the newly discovered human PDP on the Inr. Taken together, our strategy identified novel functional downstream positions within human core promoters, supporting the existence of DPE-like motifs in human promoters.


Assuntos
Genoma Humano , Regiões Promotoras Genéticas , Algoritmos , Animais , Sequência de Bases , Biologia Computacional , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Regulação da Expressão Gênica , Células HEK293 , Humanos , Modelos Genéticos , Modelos Estatísticos , RNA Polimerase II/metabolismo , Especificidade da Espécie , TATA Box , Transcrição Gênica
4.
EMBO Mol Med ; 13(7): e14314, 2021 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-34042278

RESUMO

Hormonal contraception exposes women to synthetic progesterone receptor (PR) agonists, progestins, and transiently increases breast cancer risk. How progesterone and progestins affect the breast epithelium is poorly understood because we lack adequate models to study this. We hypothesized that individual progestins differentially affect breast epithelial cell proliferation and hence breast cancer risk. Using mouse mammary tissue ex vivo, we show that testosterone-related progestins induce the PR target and mediator of PR signaling-induced cell proliferation receptor activator of NF-κB ligand (Rankl), whereas progestins with anti-androgenic properties in reporter assays do not. We develop intraductal xenografts of human breast epithelial cells from 36 women, show they remain hormone-responsive and that progesterone and the androgenic progestins, desogestrel, gestodene, and levonorgestrel, promote proliferation but the anti-androgenic, chlormadinone, and cyproterone acetate, do not. Prolonged exposure to androgenic progestins elicits hyperproliferation with cytologic changes. Androgen receptor inhibition interferes with PR agonist- and levonorgestrel-induced RANKL expression and reduces levonorgestrel-driven cell proliferation. Thus, different progestins have distinct biological activities in the breast epithelium to be considered for more informed choices in hormonal contraception.


Assuntos
Androgênios , Progestinas , Animais , Proliferação de Células , Anticoncepcionais , Camundongos
5.
EMBO Mol Med ; 13(3): e13180, 2021 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-33616307

RESUMO

Invasive lobular carcinoma (ILC) is the most frequent special histological subtype of breast cancer, typically characterized by loss of E-cadherin. It has clinical features distinct from other estrogen receptor-positive (ER+ ) breast cancers but the molecular mechanisms underlying its characteristic biology are poorly understood because we lack experimental models to study them. Here, we recapitulate the human disease, including its metastatic pattern, by grafting ILC-derived breast cancer cell lines, SUM-44 PE and MDA-MB-134-VI cells, into the mouse milk ducts. Using patient-derived intraductal xenografts from lobular and non-lobular ER+ HER2- tumors to compare global gene expression, we identify extracellular matrix modulation as a lobular carcinoma cell-intrinsic trait. Analysis of TCGA patient datasets shows matrisome signature is enriched in lobular carcinomas with overexpression of elastin, collagens, and the collagen modifying enzyme LOXL1. Treatment with the pan LOX inhibitor BAPN and silencing of LOXL1 expression decrease tumor growth, invasion, and metastasis by disrupting ECM structure resulting in decreased ER signaling. We conclude that LOXL1 inhibition is a promising therapeutic strategy for ILC.


Assuntos
Neoplasias da Mama , Carcinoma Lobular , Aminoácido Oxirredutases/genética , Animais , Carcinoma Lobular/genética , Matriz Extracelular , Feminino , Xenoenxertos , Humanos , Camundongos , Receptores de Estrogênio
6.
Genome Biol ; 21(1): 114, 2020 05 11.
Artigo em Inglês | MEDLINE | ID: mdl-32393327

RESUMO

BACKGROUND: Positional weight matrix (PWM) is a de facto standard model to describe transcription factor (TF) DNA binding specificities. PWMs inferred from in vivo or in vitro data are stored in many databases and used in a plethora of biological applications. This calls for comprehensive benchmarking of public PWM models with large experimental reference sets. RESULTS: Here we report results from all-against-all benchmarking of PWM models for DNA binding sites of human TFs on a large compilation of in vitro (HT-SELEX, PBM) and in vivo (ChIP-seq) binding data. We observe that the best performing PWM for a given TF often belongs to another TF, usually from the same family. Occasionally, binding specificity is correlated with the structural class of the DNA binding domain, indicated by good cross-family performance measures. Benchmarking-based selection of family-representative motifs is more effective than motif clustering-based approaches. Overall, there is good agreement between in vitro and in vivo performance measures. However, for some in vivo experiments, the best performing PWM is assigned to an unrelated TF, indicating a binding mode involving protein-protein cooperativity. CONCLUSIONS: In an all-against-all setting, we compute more than 18 million performance measure values for different PWM-experiment combinations and offer these results as a public resource to the research community. The benchmarking protocols are provided via a web interface and as docker images. The methods and results from this study may help others make better use of public TF specificity models, as well as public TF binding data sets.


Assuntos
Domínios e Motivos de Interação entre Proteínas , Software , Fatores de Transcrição/metabolismo , Animais , Benchmarking , Sequenciamento de Cromatina por Imunoprecipitação , Humanos , Camundongos
7.
Nat Commun ; 11(1): 1571, 2020 03 26.
Artigo em Inglês | MEDLINE | ID: mdl-32218432

RESUMO

Estrogens and progesterone control breast development and carcinogenesis via their cognate receptors expressed in a subset of luminal cells in the mammary epithelium. How they control the extracellular matrix, important to breast physiology and tumorigenesis, remains unclear. Here we report that both hormones induce the secreted protease Adamts18 in myoepithelial cells by controlling Wnt4 expression with consequent paracrine canonical Wnt signaling activation. Adamts18 is required for stem cell activation, has multiple binding partners in the basement membrane and interacts genetically with the basal membrane-specific proteoglycan, Col18a1, pointing to the basement membrane as part of the stem cell niche. In vitro, ADAMTS18 cleaves fibronectin; in vivo, Adamts18 deletion causes increased collagen deposition during puberty, which results in impaired Hippo signaling and reduced Fgfr2 expression both of which control stem cell function. Thus, Adamts18 links luminal hormone receptor signaling to basement membrane remodeling and stem cell activation.


Assuntos
Proteínas ADAMTS/metabolismo , Hormônios/farmacologia , Glândulas Mamárias Animais/citologia , Nicho de Células-Tronco , Células-Tronco/metabolismo , Proteínas ADAMTS/deficiência , Proteínas ADAMTS/genética , Animais , Antígenos CD/metabolismo , Linhagem Celular , Autorrenovação Celular/efeitos dos fármacos , Epitélio/metabolismo , Matriz Extracelular/efeitos dos fármacos , Matriz Extracelular/metabolismo , Feminino , Fibronectinas/metabolismo , Glicoproteínas/metabolismo , Humanos , Camundongos Endogâmicos C57BL , Modelos Biológicos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Receptores de Progesterona/metabolismo , Regeneração/efeitos dos fármacos , Transdução de Sinais/efeitos dos fármacos , Nicho de Células-Tronco/efeitos dos fármacos , Células-Tronco/citologia , Células-Tronco/efeitos dos fármacos , Transcrição Gênica/efeitos dos fármacos
8.
Nucleic Acids Res ; 48(D1): D65-D69, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31680159

RESUMO

The Eukaryotic Promoter Database (EPD), available online at https://epd.epfl.ch, provides accurate transcription start site (TSS) information for promoters of 15 model organisms plus corresponding functional genomics data that can be viewed in a genome browser, queried or analyzed via web interfaces, or exported in standard formats (FASTA, BED, CSV) for subsequent analysis with other tools. Recent work has focused on the improvement of the EPD promoter viewers, which use the UCSC Genome Browser as visualization platform. Thousands of high-resolution tracks for CAGE, ChIP-seq and similar data have been generated and organized into public track hubs. Customized, reproducible promoter views, combining EPD-supplied tracks with native UCSC Genome Browser tracks, can be accessed from the organism summary pages or from individual promoter entries. Moreover, thanks to recent improvements and stabilization of ncRNA gene catalogs, we were able to release promoter collections for certain classes of ncRNAs from human and mouse. Furthermore, we developed automatic computational protocols to assign orphan TSS peaks to downstream genes based on paired-end (RAMPAGE) TSS mapping data, which enabled us to add nearly 9000 new entries to the human promoter collection. Since our last article in this journal, EPD was extended to five more model organisms: rhesus monkey, rat, dog, chicken and Plasmodium falciparum.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Células Eucarióticas/metabolismo , Genômica/métodos , Regiões Promotoras Genéticas , RNA não Traduzido , Animais , Humanos , Software , Navegador
9.
Sci Rep ; 9(1): 18464, 2019 12 05.
Artigo em Inglês | MEDLINE | ID: mdl-31804560

RESUMO

Parkinson disease (PD) is characterized by a pivotal progressive loss of substantia nigra dopaminergic neurons and aggregation of α-synuclein protein encoded by the SNCA gene. Genome-wide association studies identified almost 100 sequence variants linked to PD in SNCA. However, the consequences of this genetic variability are rather unclear. Herein, our analysis on selective single nucleotide polymorphisms (SNPs) which are highly associated with the PD susceptibility revealed that several SNP sites attribute to the nucleosomes and overlay with bivalent regions poised to adopt either active or repressed chromatin states. We also identified large number of transcription factor (TF) binding sites associated with these variants. In addition, we located two docking sites in the intron-1 methylation prone region of SNCA which are required for the putative interactions with DNMT1. Taken together, our analysis reflects an additional layer of epigenomic contribution for the regulation of the SNCA gene in PD.


Assuntos
Epigênese Genética , Doença de Parkinson/genética , alfa-Sinucleína/genética , Sítios de Ligação/genética , Cromatina/metabolismo , DNA (Citosina-5-)-Metiltransferase 1/metabolismo , Metilação de DNA , Conjuntos de Dados como Assunto , Neurônios Dopaminérgicos/metabolismo , Neurônios Dopaminérgicos/patologia , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Histonas/metabolismo , Humanos , Íntrons/genética , Nucleossomos/metabolismo , Doença de Parkinson/patologia , Polimorfismo de Nucleotídeo Único , Ligação Proteica/genética , Substância Negra/citologia , Substância Negra/metabolismo , Substância Negra/patologia , alfa-Sinucleína/metabolismo
10.
Nat Struct Mol Biol ; 26(8): 744-754, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-31384063

RESUMO

Precise nucleosome organization at eukaryotic promoters is thought to be generated by multiple chromatin remodeler (CR) enzymes and to affect transcription initiation. Using an integrated analysis of chromatin remodeler binding and nucleosome occupancy following rapid remodeler depletion, we investigated the interplay between these enzymes and their impact on transcription in yeast. We show that many promoters are affected by multiple CRs that operate in concert or in opposition to position the key transcription start site (TSS)-associated +1 nucleosome. We also show that nucleosome movement after CR inactivation usually results from the activity of another CR and that in the absence of any remodeling activity, +1 nucleosomes largely maintain their positions. Finally, we present functional assays suggesting that +1 nucleosome positioning often reflects a trade-off between maximizing RNA polymerase recruitment and minimizing transcription initiation at incorrect sites. Our results provide a detailed picture of fundamental mechanisms linking promoter nucleosome architecture to transcription initiation.


Assuntos
Montagem e Desmontagem da Cromatina/fisiologia , Saccharomyces cerevisiae/genética , Sítio de Iniciação de Transcrição , Iniciação da Transcrição Genética/fisiologia , Montagem e Desmontagem da Cromatina/genética , DNA Fúngico/genética , DNA Intergênico/genética , DNA Intergênico/metabolismo , Substâncias Macromoleculares/metabolismo , Nuclease do Micrococo/metabolismo , Nucleossomos/metabolismo , Saccharomyces cerevisiae/enzimologia , Proteínas de Saccharomyces cerevisiae/metabolismo
11.
Bioinformatics ; 35(21): 4440-4441, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31116370

RESUMO

SUMMARY: We present SPar-K (Signal Partitioning with K-means), a method to search for archetypical chromatin architectures by partitioning a set of genomic regions characterized by chromatin signal profiles around ChIP-seq peaks and other kinds of functional sites. This method efficiently deals with problems of data heterogeneity, limited misalignment of anchor points and unknown orientation of asymmetric patterns. AVAILABILITY AND IMPLEMENTATION: SPar-K is a C++ program available on GitHub https://github.com/romaingroux/SPar-K and Docker Hub https://hub.docker.com/r/rgroux/spar-k. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Cromatina , Imunoprecipitação da Cromatina , Genoma , Genômica
12.
PLoS One ; 13(11): e0206823, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30418981

RESUMO

Regulation of mRNA stability by RNA-protein interactions contributes significantly to quantitative aspects of gene expression. We have identified potential mRNA targets of the AU-rich element binding protein AUF1. Myc-tagged AUF1 p42 was induced in mouse NIH/3T3 cells and RNA-protein complexes isolated using anti-myc tag antibody beads. Bound mRNAs were analyzed with Affymetrix microarrays. We have identified 508 potential target mRNAs that were at least 3-fold enriched compared to control cells without myc-AUF1. 22.3% of the enriched mRNAs had an AU-rich cluster in the ARED Organism database, against 16.3% of non-enriched control mRNAs. The enrichment towards AU-rich elements was also visible by AREScore with an average value of 5.2 in the enriched mRNAs versus 4.2 in the control group. Yet, numerous mRNAs were enriched without a high ARE score. The enrichment of tetrameric and pentameric sequences suggests a broad AUF1 p42-binding spectrum at short U-rich sequences flanked by A or G. Still, some enriched mRNAs were highly unstable, as those of TNFSF11 (known as RANKL), KLF10, HES1, CCNT2, SMAD6, and BCL6. We have mapped some of the instability determinants. HES1 mRNA appeared to have a coding region determinant. Detailed analysis of the RANKL and BCL6 3'UTR revealed for both that full instability required two elements, which are conserved in evolution. In RANKL mRNA both elements are AU-rich and separated by 30 bases, while in BCL6 mRNA one is AU-rich and 60 bases from a non AU-rich element that potentially forms a stem-loop structure.


Assuntos
Ribonucleoproteínas Nucleares Heterogêneas Grupo D/metabolismo , Proteínas Proto-Oncogênicas c-bcl-6/genética , Ligante RANK/genética , Estabilidade de RNA/genética , Regiões 3' não Traduzidas/genética , Elementos Ricos em Adenilato e Uridilato/genética , Animais , Sítios de Ligação/genética , Células HEK293 , Ribonucleoproteína Nuclear Heterogênea D0 , Ribonucleoproteínas Nucleares Heterogêneas Grupo D/genética , Humanos , Camundongos , Células NIH 3T3 , Análise de Sequência com Séries de Oligonucleotídeos , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Proteínas Proto-Oncogênicas c-bcl-6/metabolismo , Ligante RANK/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo
13.
PeerJ ; 6: e5362, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30083469

RESUMO

To detect functional somatic mutations in tumor samples, whole-exome sequencing (WES) is often used for its reliability and relative low cost. RNA-seq, while generally used to measure gene expression, can potentially also be used for identification of somatic mutations. However there has been little systematic evaluation of the utility of RNA-seq for identifying somatic mutations. Here, we develop and evaluate a pipeline for processing RNA-seq data from glioblastoma multiforme (GBM) tumors in order to identify somatic mutations. The pipeline entails the use of the STAR aligner 2-pass procedure jointly with MuTect2 from genome analysis toolkit (GATK) to detect somatic variants. Variants identified from RNA-seq data were evaluated by comparison against the COSMIC and dbSNP databases, and also compared to somatic variants identified by exome sequencing. We also estimated the putative functional impact of coding variants in the most frequently mutated genes in GBM. Interestingly, variants identified by RNA-seq alone showed better representation of GBM-related mutations cataloged by COSMIC. RNA-seq-only data substantially outperformed the ability of WES to reveal potentially new somatic mutations in known GBM-related pathways, and allowed us to build a high-quality set of somatic mutations common to exome and RNA-seq calls. Using RNA-seq data in parallel with WES data to detect somatic mutations in cancer genomes can thus broaden the scope of discoveries and lend additional support to somatic variants identified by exome sequencing alone.

14.
Bioinformatics ; 34(14): 2483-2484, 2018 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-29514181

RESUMO

Summary: Transcription factors regulate gene expression by binding to specific short DNA sequences of 5-20 bp to regulate the rate of transcription of genetic information from DNA to messenger RNA. We present PWMScan, a fast web-based tool to scan server-resident genomes for matches to a user-supplied PWM or transcription factor binding site model from a public database. Availability and implementation: The web server and source code are available at http://ccg.vital-it.ch/pwmscan and https://sourceforge.net/projects/pwmscan, respectively. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Genômica/métodos , Matrizes de Pontuação de Posição Específica , Sequências Reguladoras de Ácido Nucleico , Software , Fatores de Transcrição/metabolismo , DNA/metabolismo , Humanos , Ligação Proteica
15.
Nucleic Acids Res ; 46(D1): D175-D180, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29069466

RESUMO

The Mass Genome Annotation (MGA) repository is a resource designed to store published next generation sequencing data and other genome annotation data (such as gene start sites, SNPs, etc.) in a completely standardised format. Each sample has undergone local processing in order the meet the strict MGA format requirements. The original data source, the reformatting procedure and the biological characteristics of the samples are described in an accompanying documentation file manually edited by data curators. 10 model organisms are currently represented: Homo sapiens, Mus musculus, Danio rerio, Drosophila melanogaster, Apis mellifera, Caenorhabditis elegans, Arabidopsis thaliana, Zea mays, Saccharomyces cerevisiae and Schizosaccharomyces pombe. As of today, the resource contains over 24 000 samples. In conjunction with other tools developed by our group (the ChIP-Seq and SSA servers), it allows users to carry out a great variety of analysis task with MGA samples, such as making aggregation plots and heat maps for selected genomic regions, finding peak regions, generating custom tracks for visualizing genomic features in a UCSC genome browser window, or downloading chromatin data in a table format suitable for local processing with more advanced statistical analysis software such as R. Home page: http://ccg.vital-it.ch/mga/.


Assuntos
Bases de Dados de Ácidos Nucleicos , Animais , Imunoprecipitação da Cromatina , Curadoria de Dados , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Anotação de Sequência Molecular , Ferramenta de Busca
16.
Nat Methods ; 14(3): 316-322, 2017 03.
Artigo em Inglês | MEDLINE | ID: mdl-28092692

RESUMO

Resolving the DNA-binding specificities of transcription factors (TFs) is of critical value for understanding gene regulation. Here, we present a novel, semiautomated protein-DNA interaction characterization technology, selective microfluidics-based ligand enrichment followed by sequencing (SMiLE-seq). SMiLE-seq is neither limited by DNA bait length nor biased toward strong affinity binders; it probes the DNA-binding properties of TFs over a wide affinity range in a fast and cost-effective fashion. We validated SMiLE-seq by analyzing 58 full-length human, mouse, and Drosophila TFs from distinct structural classes. All tested TFs yielded DNA-binding models with predictive power comparable to or greater than that of other in vitro assays. De novo motif discovery on all JUN-FOS heterodimers and several nuclear receptor-TF complexes provided novel insights into partner-specific heterodimer DNA-binding preferences. We also successfully analyzed the DNA-binding properties of uncharacterized human C2H2 zinc-finger proteins and validated several using ChIP-exo.


Assuntos
Dedos de Zinco CYS2-HIS2/fisiologia , Proteínas de Ligação a DNA/metabolismo , DNA/metabolismo , Proteínas Quinases JNK Ativadas por Mitógeno/metabolismo , Proteínas Proto-Oncogênicas c-fos/metabolismo , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação/genética , Biologia Computacional , Drosophila/genética , Regulação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Proteínas Quinases JNK Ativadas por Mitógeno/genética , Camundongos , Microfluídica/métodos , Proteínas Proto-Oncogênicas c-fos/genética , Análise de Sequência de DNA/métodos
17.
Nucleic Acids Res ; 45(D1): D139-D144, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899579

RESUMO

SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/.


Assuntos
Sítios de Ligação , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Polimorfismo de Nucleotídeo Único , Fatores de Transcrição , Algoritmos , Genoma Humano , Genômica/métodos , Humanos , Ligação Proteica , Fatores de Transcrição/metabolismo , Navegador
18.
Nucleic Acids Res ; 45(D1): D51-D55, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899657

RESUMO

We present an update of the Eukaryotic Promoter Database EPD (http://epd.vital-it.ch), more specifically on the EPDnew division, which contains comprehensive organisms-specific transcription start site (TSS) collections automatically derived from next generation sequencing (NGS) data. Thanks to the abundant release of new high-throughput transcript mapping data (CAGE, TSS-seq, GRO-cap) the database could be extended to plant and fungal species. We further report on the expansion of the mass genome annotation (MGA) repository containing promoter-relevant chromatin profiling data and on improvements for the EPD entry viewers. Finally, we present a new data access tool, ChIP-Extract, which enables computational biologists to extract diverse types of promoter-associated data in numerical table formats that are readily imported into statistical analysis platforms such as R.


Assuntos
Bases de Dados de Ácidos Nucleicos , Regiões Promotoras Genéticas , Animais , Eucariotos/genética , Fungos/genética , Humanos , Plantas/genética , Sítio de Iniciação de Transcrição
19.
BMC Genomics ; 17(1): 938, 2016 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-27863463

RESUMO

BACKGROUND: ChIP-seq and related high-throughput chromatin profilig assays generate ever increasing volumes of highly valuable biological data. To make sense out of it, biologists need versatile, efficient and user-friendly tools for access, visualization and itegrative analysis of such data. RESULTS: Here we present the ChIP-Seq command line tools and web server, implementing basic algorithms for ChIP-seq data analysis starting with a read alignment file. The tools are optimized for memory-efficiency and speed thus allowing for processing of large data volumes on inexpensive hardware. The web interface provides access to a large database of public data. The ChIP-Seq tools have a modular and interoperable design in that the output from one application can serve as input to another one. Complex and innovative tasks can thus be achieved by running several tools in a cascade. CONCLUSIONS: The various ChIP-Seq command line tools and web services either complement or compare favorably to related bioinformatics resources in terms of computational efficiency, ease of access to public data and interoperability with other web-based tools. The ChIP-Seq server is accessible at http://ccg.vital-it.ch/chipseq/ .


Assuntos
Imunoprecipitação da Cromatina , Biologia Computacional/métodos , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Software , Navegador , Anotação de Sequência Molecular , Interface Usuário-Computador
20.
PLoS Comput Biol ; 12(10): e1005144, 2016 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-27716823

RESUMO

The recruitment of RNA-Pol-II to the transcription start site (TSS) is an important step in gene regulation in all organisms. Core promoter elements (CPE) are conserved sequence motifs that guide Pol-II to the TSS by interacting with specific transcription factors (TFs). However, only a minority of animal promoters contains CPEs. It is still unknown how Pol-II selects the TSS in their absence. Here we present a comparative analysis of promoters' sequence composition and chromatin architecture in five eukaryotic model organisms, which shows the presence of common and unique DNA-encoded features used to organize chromatin. Analysis of Pol-II initiation patterns uncovers that, in the absence of certain CPEs, there is a strong correlation between the spread of initiation and the intensity of the 10 bp periodic signal in the nearest downstream nucleosome. Moreover, promoters' primary and secondary initiation sites show a characteristic 10 bp periodicity in the absence of CPEs. We also show that DNA natural variants in the region immediately downstream the TSS are able to affect both the nucleosome-DNA affinity and Pol-II initiation pattern. These findings support the notion that, in addition to CPEs mediated selection, sequence-induced nucleosome positioning could be a common and conserved mechanism of TSS selection in animals.


Assuntos
DNA/genética , Nucleossomos/genética , Regiões Promotoras Genéticas/genética , RNA Polimerase II/genética , Sítio de Iniciação de Transcrição/fisiologia , Transcrição Gênica/genética , Sequência de Bases , Sítios de Ligação , Simulação por Computador , Modelos Genéticos , Dados de Sequência Molecular , Ativação Transcricional/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...